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Abstract 

We construct a theoretical model for equilibrium distribution of 
workers across sectors with different labor productivity, assuming that 
a sector can accommodate a limited number of workers which depends 
only on its productivity. A general formula for such distribution of pro- 
ductivity is obtained, using the detail-balance condition necessary for 
equilibrium in the Ehrenfest-Brillouin model. We also carry out an em- 
pirical analysis on the average number of workers in given productivity 
sectors on the basis of an exhaustive dataset in Japan. The theoretical 
formula succeeds in explaining the two distinctive observational facts 
in a unihcd way, that is, a Boltzmann distribution with negative tem- 
perature on low-to-medium productivity side and a decreasing part in 
a power-law form on high productivity side. 
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1 Introduction 



The concept of equilibrium plays a central role in economics. Of all, the 
most influential is the Walrasian general equilibrium as represented by Arrow 
and Debreu pQ. Though it is a grand concept, and well established in the 
profession, it cannot be more different from the real economy. The Walrasian 
theory specifies preferences and technologies of all the consumers and firms, 
and defines the equilibrium in which micro-behaviors of all the economic 
agents are precisely determined. It is just as one analyses object such as gas 
comprising many particles by determining the equations of motion for all the 
particles. Physicists know that this approach though it may look reasonable 
at first sight, is actually infeasible and on the wrong track. Instead, following 
the lead of Maxwell, Boltzmann and Gibbs, they have developed statistical 
physics. Curiously, despite of the fact that the macroeconomy consists of 
many heterogeneous consumers and firms, the basic method of statistical 
physics has had almost no impact on economics. 

In the Walrasian equilibrium, the marginal productivities of production 
factors such as labor and capital are equal in all the firms, industries, and 
sectors. Otherwise, there exists inefficiency in the economy, and it contra- 
dicts the notion of equilibrium. However, in the real economy, we actually 
observe significant productivity dispersion. That is, there is a distribution 
rather than a unique level of productivity. Search theory has attempted to 
explain such distribution by considering frictions and search costs which ex- 
ist in the real economy [2JE1I1]. However, it is still based on representative 
agent assumptions [5]. 

To explain equilibrium distribution, the most natural and promising ap- 
proach is to eschew pursuit of precise micro-behavior on representative agent 
assumptions, and resort to the method of statistical physics. Foley [6] is a 
seminal work which applies such statistical method to the general equilib- 
rium model. Yoshikawa [7] argues that the study of productivity dispersion 
provides correct micro- foundations for Keynesian economics, and that to ex- 
plain distribution of productivity we should apply the method of statistical 
physics. In a series of papers, we have attempted to establish the empirical 
distribution using a large data set covering more than a million firms in the 
Japanese manufacturing and non-manufacturing industries [8j El \10\ \TT\ . To 
explain this empirically observed distribution of productivity, Iyetomi j!2] 
introduced the notion of negative temperature. Based on this notion of neg- 
ative temperature, Yoshikawa [5j also made a similar attempt with the help 
of grandcanonical partition function. 

In this paper, we explore the problem from a different angle than the 
standard entropy maximization. Before doing so, we first update our empir- 
ical investigation of distribution of labor productivity in section 2. Most 
of theoretical works exploring distribution of productivity resort to the 
straight-forward entropy maximization. Instead, Scalas and Garibaldi [13J 
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suggest that we study the same problem using the Ehrenfest-Brillouin model, 
a Markov chain which describes random creations and destructions in sys- 
tem comprising many elements moving across a finite number of categories. 
Following their lead, we present such a model in section 3. By considering 
detailed balance, we derive the stationary distribution of the model which 
explains the empirically observed distribution. Section 4 offers brief con- 
cluding remarks. 



2 Distribution of Labor Productivity 

The labor productivity denoted by c, is simply defined by 

Y 

c:=- (1) 
n 

Here, Y is the value added in units of 10 3 yen, and n the number of workers. 
Iyetomi [12] has studied the firm data in Japan for the year 2006. Since 
then, we have obtained the data up to the year 2010 and will use the 2008 
data in this paper, as it contains the largest number of firms. 

Let us briefly review the method of calculating the value added Y. 
Dataset is constructed by unifying two datasets, the Nikkei Economic Elec- 
tric Database (NEEDS) |14| for large firms (most of which are listed) and 
and the Credit Risk Database (CRD) [15] for small to medium firms. The 
value added Y is calculated by the so-called BoJ method, established by the 
Statistics Department of the Bank of Japan, and gives the value added as 
the sum of net profits, labor costs, financing costs, rental expenses, taxes, 
and depreciation costs. Although the original datasets contain over a mil- 
lion firms together, by limiting the analysis to firms which have non-empty 
entries in all these items, we end up with 180,181 firms for 20080 

Figure [T] shows the PDF of the firms' and the worker's log of the labor 
productivity c in units of 10 3 yen/person. The fact that the major peak of 
the latter is shifted to right compared to that of the former indicates that the 
average number of workers per firm n increases in this region. In fact, Fig. [2] 
shows the dependence of n on the labor productivity of the firm (c). We 
observe that as the productivity rises, it first goes up to about n ~ 200 and 
then decreases. Iyetomi |12] explained the upward-sloping distribution in 
the low productivity region by introducing the negative temperature theory. 
The downward-sloping part in the high productivity region is close to linear 
(denoted by the dotted line) in this double-log plot. This indicates that it 
obeys the power low: 

(2) 



lr This covers all the industrial sectors, except for the finance and insurance (95 firms in 
2008), the deep-sea foreign transport of freight (9 firms in 2008), and the holding companies 
(138 firms in 2008), all of which show abnormally high value of the productivity compared 
with other sectors. 
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Figure 1: PDF of the firms' logc (solid lines) and workers' logc 
(dashed lines). The productivity c is in units of 10 3 
yen/person. 




Figure 2: Dependence of the average number of workers n on the 
labor productivity c of firms (dots connected by thick 
lines) The dotted straight line has the gradient —1, that 
is, n oc 1/c. 
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We have studied this phenomenon in the period of 2000 through 2008, 
and not only for all the sectors but also for the manufacturing and the 
non-manufacturing sectors separately. It turns out that we always find the 
qualitatively same pattern as shown in Fig. [21 the number of workers expo- 
nentially increases as c increases up to a certain level of productivity whereas 
it decreases following power law (Eq.([2])) in the high productivity region. We 
thus conclude that this broad shape of distribution of productivity among 
firms is quite robust and universal. We note that this is somewhat counter- 
intuitive in the sense that firms that achieved higher productivity through 
innovation and high-quality management would grow larger, so that equi- 
librium distribution would simply have monotonically increasing n with c. 
Therefore we need to find what is the main reason that causes this behavior, 
which we will do in the following section. 



One way to analyze the equilibrium distribution of labor productivity based 
on statistical physics is to maximize entropy. Instead, Scalas and Garibaldi |13j 
suggest that we can usefully apply the Ehrenfest-Brillouin model, a Markov 
chain to analyze the problem. In this section, we present such a model. 

The macroeconomy consists of many firms with different levels of pro- 
ductivity. Differences in productivity arise from different capital stocks, 
levels of technology and/or demand conditions facing firms. We call a group 
of firms with the same level of productivity a cluster. Workers randomly 
move from a cluster to another for various reasons at various times. De- 
spite of these random changes, the distribution of labor productivity as a 
whole remains stable because those incessant random movements balance 
with each other. This balancing must be achieved for each cluster, and is 
called detail-balance. In the following, we present a general treatment of 
this detail-balance using particle-correlation theory a la [16J. In doing so, 
we make an assumption that the number of workers who belong to clusters 
with high productivity is constrained; see also Ref. [5]. 

We denote the number of workers who belong to a cluster with the level 
of labor productivity, Cj by n^. The total output in the economy as a whole, 
Y is assumed to be equal to aggregate demand D, and is given: 



For the productivity distribution of workers specified by {ni, ri2, n.3, . . . } 
to be in equilibrium, the number of workers who move out of cluster i per 
unit time must be equal to that of workers who move into this cluster per 
unit time. We consider the minimal process that satisfies the condition that 
the total output in the economy as a whole is conserved with Eq. ([3]). In 



3 The Ehrenfest-Brillouin Model 




(3) 
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this process, two workers move simultaneously [13} 117] . This is illustrated 
in Figure [3ja): A worker in a cluster with productivity Cj and a worker in 
a cluster with productivity Cj move to clusters with productivities c k and 
C£, respectively. For the total output Y to remain constant, the following 
condition must be satisfied: 

Ci + Cj = c k + eg. (4) 

Such job-switchings occur for various unspecifiable reasons. The best we 
can do is to consider a Markov chain defined by transition rates, P(i,j; k, £). 
They have the following trivial symmetries: 

P(i,j;k,£)=P(j,i;k,£), P(i,j;k,£) = P(i,j;£,k). (5) 

We also assume that the reverse process, illustrated in Figure E^b) occurs 
with the same probability: 

P{i,j;k,£)=P(k,£;i,j). (6) 

Equilibrium condition then requires the number of workers moving from 
to (k,£) per unit time denoted by N(i, j;k,£) must be equal to the 
number of those from (k,£) to denoted by N(k,£;i,j): 

N(i,j;k,£)=N(k,£;i,j). (7) 

The flux N(i,j; k,£) is proportional to the numbers of workers in clusters % 
and j, rii and rij and the corresponding transition rate P(i,j;k,£): 

N{i,j;k,£) (xP{i,j;k,£)n inj . (8) 

The fundamental assumption we make is that a cluster with productivity 
c can accommodate g(c) workers at most. It means that 

N(i,j;k,£) = P(i,j;k,£)ninjL(c k ,n k )L(ce,nt), (9) 
(a) (b) 




Figure 3: Elementary process where (a) a worker at the firm i 
and a worker at the firm j move to firms k and £, with 
probability P(i,j; k,£) and (b) the reverse process with 
probability P(k,£;i,j). 
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where L(c, n) is a function that limits the number of workers in a cluster 
with productivity c: 

L(c,n)=0 for n>g(c). (10) 

One can obtain a general solution in terms of L(c, n) for the detail- 
balance equation ([7]) in the following way. Thanks to Eq. ([6]), substituting 
Eq. ([9|) into Eq. (|7|) enables us to find 

H( Ci )H( Cj ) = H(c k )H(c e ), (11) 

where H{c) := n/L(c,n). Because of Eq. we obtain 

H{ Ci )H{ Cj ) = H(a + Cj )H(0) (12) 

or, by denoting G(c) := log(H(c)/H(0)), 

G(c i ) + G(c j ) = G(c i + c j ). (13) 

This proves that G(c) is linear in c. It leads us to 

n = L{c,n)e-P {c -^, (14) 

where [i and f3 are real free parameters. Once the function L(c, n) is given, 
the equilibrium distribution n can be obtained by solving the above. 

In order to model the distribution of labor productivity, we need to allow 
g to be any integer number. Furthermore, we find it most natural to choose 
L(c, n) so that it is continuous at n = g(c). Here we adopt a simple linear 
model, 

L(c,n) = %(c)-n)^p, (15) 

as depicted in Fig. HI We can reasonably assume that L(c, 0) = 1, because 
there would be no restrictions for hiring workers if there are none in the 
firm. 
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By substituting Eq. (|15p into Eq. (j!4[) and solving for n, we finally obtain 



- _ 9(c) nfi s 
n ~ g(c)e^)+l [ ' 

This is a simple extension of the Fermi-Dirac statistics. In passing, we note 
that the partition function Z that yields Eq. (|16|) is 

i \ 9(c) 

1 + —-e~^ c ~^ . (17) 
9{c) ) 

It is a reasonable extension of the Fermi-Dirac statistics in the sense that 
the partition function has the expansion 

Z = 1 + e -^ c ~^ + ■■■ (18) 

and yet allows existence of g(c) levels. 



4 Results and Discussion 

First, we note that when there is no limit to the number of the workers, i.e., 
g — > oo, Eq. (|16j) boils down to the Boltzmann distribution, 

n = e -^ c -^. (19) 

When we apply Eq. (|19p to low-to-intermediate range of c where n is an 
exponentially increasing function of c as observed in Fig. [2 we must have 

P < 0, (20) 

the negative temperature. We, therefore, assume that (3 is negative in the 
following. The current model thus incorporates the Boltzmann statistics 
model with negative temperature advanced in Ref. |12j . 

Secondly, we recall the observation that the power law ([2]) holds for n in 
the high productivity side. We can use this empirical fact to determine the 
functional form for g(c). Equation (|16p implies that when temperature is 
negative, n approaches g(c) in the limit c — > oo. These arguments persuade 
us to adopt the following anzatz for g(c), 

g(c) = Ac~\ (21) 

Given the present model, explaining the empirically observed distribu- 
tion of productivity is equivalent to determining four parameters, f3, /i, A, 
and 7 in Eqs. ()16p and (|2ip . We estimate these four parameters by the x 2 
fit to the empirical results as shown in Fig. [5J 

Figure [5] demonstrates the results of the best fit for three datasets of 
firms, namely, those in all the sectors, the manufacturing sector, and the 
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1000 




Figure 5: The best fits to the data; Solid curve is for all sectors, 
the dashed curve for the manufacturing sector, the dot- 
dashed curve for the non-manufacturing sector. 





/3 (xHT 4 ) 


li (xlO 4 ) 


A (xlO 7 ) 


7 


c P (xlO 4 ) 


All 


-1.25 


-2.32 


5.84 


1.18 


3.14 


Manucacturing 


-1.78 


-1.63 


8.51 


1.17 


2.70 


Non-manufacturing 


-0.86 


-3.47 


1.52 


1.08 


3.74 



Table 1: Best-fit parameters and the position of the peak c p . 



non-manufacturing sector. The fitted parameters are listed in Table [TJ The 
present model is quite successful in unifying the two opposing functional 
behaviors of the average number of workers in the low-to-medium and high 
productivity regimes. The crossover takes place at the productivity c p in 
the nonmanufacturing sector which is about 40% as high as that in the 
manufacturing sector. 

Also we see that the inverse temperature /3 of the non-manufacturing 
sector is just half of that of the manufacturing sector. This manifests there 
is a much wider demand gap in the non-manufacturing sector. The economic 
system is thus far away from equilibrium in demand exchange. In contrast, 
(3 times \x gives almost the same values for the two sectors, indicating that 
the system seems to be in equilibrium as regards exchange of workers. These 
findings agree well with those obtained in the previous study |12j . 
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5 Conclusion 



A theoretical model was proposed to account for empirical facts on distribu- 
tion of workers across clusters with different labor productivity. Its key idea 
is to assume that there are restrictions on capacity of workers for clusters 
with high productivity. This is a rational assumption because most of firms 
belonging to such superb clusters are expected to be in a cutting-edge stage 
to lead industry. We then derived a general formula for the equilibrium 
distribution of productivity, adopting the Ehrenfest-Brillouin model along 
with the detail-balance condition necessary for equilibrium. Fitting of the 
model to the empirical results confirmed that the theoretical model could 
encompass both of a Boltzmann distribution with negative temperature on 
low-to-medium productivity side and a decreasing part in a power-law form 
on high productivity side. 
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